56 research outputs found

    Generating and applying textual entailment graphs for relation extraction and email categorization

    Get PDF
    Recognizing that the meaning of one text expression is semantically related to the meaning of another can be of help in many natural language processing applications. One semantic relationship between two text expressions is captured by the textual entailment paradigm, which is defined as a relation between exactly two text expressions. Entailment relations holding among a set of more than two text expressions can be captured in the form of a hierarchical knowledge structure referred to as entailment graphs. Despite the fact that several people have worked on building entailment graphs for different types of textual expressions, little research has been carried out regarding the applicability of such entailment graphs in NLP applications. This thesis fills this research gap by investigating how entailment graphs can be generated and used for addressing two specific NLP tasks: First, the task of validating automatically derived relation extraction patterns and, second, the task of automatically categorizing German customer emails. After laying a theoretical foundation, the research problem is approached in an empirical way, i.e., by drawing conclusions from analyzing, processing, and experimenting with specific task-related datasets. The experimental results show that both tasks can benefit from the integration of semantic knowledge, as expressed by entailment graphs

    Regularized inner products and errors of modularity

    Get PDF
    We develop a regularization for Petersson inner products of arbitrary weakly holomorphic modular forms, generalizing several known regularizations. As one application, we extend work of Duke, Imamoglu, and Toth on regularized inner products of weakly holomorphic modular forms of weights 0 and 3=2. These regularized inner products can be evaluated in terms of the coefficients of holomorphic parts of harmonic Maass forms of dual weights. Moreover, we study the errors of modularity of the holomorphic parts of such a harmonic Maass forms and show that they induce cocyles in the first parabolic cohomology group introduced by Bruggeman, Choie, and the second author. This provides explicit representatives of the cohomology classes constructed abstractly and in a very general setting in their work

    multi level alignments as an extensible representation basis for textual entailment algorithms

    Get PDF
    A major problem in research on Textual Entailment (TE) is the high implementation effort for TE systems. Recently, interoperable standards for annotation and preprocessing have been proposed. In contrast, the algorithmic level remains unstandardized, which makes component re-use in this area very difficult in practice. In this paper, we introduce multi-level alignments as a central, powerful representation for TE algorithms that encourages modular, reusable, multilingual algorithm development. We demonstrate that a pilot open-source implementation of multi-level alignment with minimal features competes with state-of-theart open-source TE engines in three languages

    Consensus Paper: Radiological Biomarkers of Cerebellar Diseases

    Get PDF
    Hereditary and sporadic cerebellar ataxias represent a vast and still growing group of diseases whose diagnosis and differentiation cannot only rely on clinical evaluation. Brain imaging including magnetic resonance (MR) and nuclear medicine techniques allows for characterization of structural and functional abnormalities underlying symptomatic ataxias. These methods thus constitute a potential source of radiological biomarkers, which could be used to identify these diseases and differentiate subgroups of them, and to assess their severity and their evolution. Such biomarkers mainly comprise qualitative and quantitative data obtained from MR including proton spectroscopy, diffusion imaging, tractography, voxel-based morphometry, functional imaging during task execution or in a resting state, and from SPETC and PET with several radiotracers. In the current article, we aim to illustrate briefly some applications of these neuroimaging tools to evaluation of cerebellar disorders such as inherited cerebellar ataxia, fetal developmental malformations, and immune-mediated cerebellar diseases and of neurodegenerative or early-developing diseases, such as dementia and autism in which cerebellar involvement is an emerging feature. Although these radiological biomarkers appear promising and helpful to better understand ataxia-related anatomical and physiological impairments, to date, very few of them have turned out to be specific for a given ataxia with atrophy of the cerebellar system being the main and the most usual alteration being observed. Consequently, much remains to be done to establish sensitivity, specificity, and reproducibility of available MR and nuclear medicine features as diagnostic, progression and surrogate biomarkers in clinical routine

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
    corecore